Concept
Data Science
Parents
Children
AlgorithmsBig Data AnalyticsBiomedical Data ScienceData ManagementData Mining
663.6K
Publications
46.3M
Citations
947K
Authors
33.8K
Institutions
Pattern and Relational Foundations
1951 - 1980
During this era, robust data organization and relational modeling established the backbone for future data science, enabling data independence and structured representations of data and similarity. Clustering and pattern discovery matured as central methodologies, with early unsupervised approaches and the nascent knowledge discovery in databases shaping subsequent workflows. Measures of distance and proximity guided representation, retrieval, and evaluation, while pattern recognition and image analysis provided benchmark problems for automated perception.
• Foundations for data organization and database theory underpin later data science, emphasizing relational models, data management, and semantic structuring of data, including representations of structure and similarity data [5], [11], [13], [18], [19].
• Clustering and pattern discovery matured as core data science methods: early unsupervised clustering approaches, model-based clustering, document clustering, and the beginnings of knowledge discovery in databases (KDD) [2], [3], [7], [8], [9], [16].
• Distances, proximities, and similarity measures organize data representations, guiding clustering and retrieval; MD scaling with an unknown distance function, similarity data representations, and fuzzy-set validity shape evaluation [1], [4], [10], [12], [19].
• Pattern recognition and image analysis provide benchmark problems and algorithmic ideas for automated perception; this includes pattern classification, linear pattern matching algorithms, and scene analysis [4], [8], [20].
• Algorithmic infrastructure for scalable data processing includes efficient range-search data structures, inverted database structures, and data representation diagraphs that support storage and querying [6], [13], [17], [18].
Foundations of Data Mining
1981 - 1996
Graph-Based Pattern Mining
1997 - 2003
Large-Scale Unified Representations
2004 - 2010
End-to-End Deep Vision
2011 - 2017
Self-Supervised Multimodal Representations
2018 - 2024